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RELATED APPLICATIONS 



This application claims the benefit of Tabor and Richardson, U.S. Provisional 
Application 60/1 15,498 filed January 11, 1999, entitled ISOTHERMAL 
AMPLIFICATION OF DNA, which is hereby incorporated by reference in its entirety, 
including drawings. 



FIELD OF THE INVENTION 



The present invention relates to the field of amplification of DNA and RNA. 



BACKGROUND OF THE INVENTION 

This invention was made with government support including a grant fi-om the 
U.S. Dept. of Energy, contract number DE-FG02-96ER62251. The U.S. government 
may have certain rights in the invention. 

The following is a discussion of the relevant art, none of which is admitted to be 
prior art to the appended claims. 

A variety of methods have been developed to efficiently ampUfy DNA using 
purified proteins. These generally can be classified as ones that use thermocycling of the 
reaction mixtures and ones in which the reactions are isothermal. Methods that use 
thermocycling of the reaction mixture are based on the polymerase chain reaction, or 
PCR (Saiki et al. Science 230:1350, 1985). In PGR, two primers are added to the target 
DNA, such that the two primers are complementary to opposite strands of the target 
sequence with their 3' ends oriented towards one another. Multiple cycles of 
denaturation of target DNA, annealing of the two primers, and then extension of the 
primers with a DNA polymerase, results in an exponential synthesis of the target DNA 
sequence located between the two primers. This procedure requires the use of a DNA 
polymerase that is thermostable in order to survive the high temperatures required toRECEIVEC 
denature the product DNA each cycle. The most common polymerase used for PCR ji|Qy \ 4 2003 



OFFICE OF PEimONS 



249/107 
Patent 



from Thermus aquaticus, or Taq DNA polymerase (Saiki et al. Science 239:487, 1988). 
Modification of the reaction conditions and the enzyme mixture used can allow the 
amplification of DNA up to 50 kbp in length (Barnes, Proc. Natl. Acad. Sci. USA 
91:2216, 1994 and Cheng et aL, Proc. Natl Acad. Sci. USA 91:5695, 1994). 

5 Isothermal procedures for the amplification of nucleic acid include (1) QP 

replicase, (2) self-sustained sequence replication (3SR), (3) strand displacement 
amplification (SDA), (4) terminal protein-primed DNA amplification using 029 DNA 
polymerase, and (5) rolling circle amplification (RCA). 

The RNA-dependent RNA polymerase QP polymerase has been used for gene 
10 detection in a strategy that uses two probes that hybridize to adjacent positions on a 
target sequence that are then ligated to form an amplifiable reporter RNA (Tyagi et al.y 
Proc, Natl Acad. Sci. USA 93:5395, 1996). 

Self-sustained sequence replication (3SR) is a method of amplifying RNA that 
depends on the action of three enzymes; reverse transcriptase, DNA-dependent RNA 
1 5 polymerase and ribonuclease H (Guatelli et al , Proc. Natl. Acad. Sci. USA 87: 1 874, 
1990). 

Strand displacement amplification (SDA) is a DNA amplification system that 
uses a restriction enzyme to introduce specific nicks in a target to be amplified and a 
DNA polymerase that is capable of strand displacement synthesis (Walker et a/., Proc. 
20 Natl. Acad. Sci. USA 89:392, 1992). 

Terminal protein-primed DNA amplification exponentially amplifies linear 029 
DNA using the 029 proteins DNA polymerase, terminal protein, double-stranded DNA 
binding protein, and single-stranded DNA binding protein (Blanco et al, Proc. Natl. 
Acad. Sci. USA 91:12198, 1994). To date this method has only been successfiil using 
25 linear 029 DNA as template. 

Rolling circle amplification (RCA) can either be linear or exponential 
ampUfication of circular oligonucleotide probes that use two primers that anneal to each 
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of the two strands, and a DNA polymerase that strand displaces (Lizardi et al. Nature 
Qenetics 19:225, 1998 andZhangee/fl/.,fiene211:277, 1998). 

In order to be used as a generalized ampHfication system, all of these methods 
require the addition of specific oligonucleotide primers that are extended by a 
polymerase; the primers serve to fix the limits of the segment of nucleic acid to be 
amplified. . 

SUMMARY OF THE INVENTTON 

The present invention provides an in vitro method to amplify DNA exponentially 
at a constant temperature and without the use of exogenously added primers by using a 
DNA polymerase and various accessory proteins. The accessory proteins function with 
the DNA polymerase to carry out amplification under these conditions. 

In one aspect, the method involves the addition of a mixture of enzymes 
including a DNA polymerase, a helicase and a primajse and single-stranded binding 
protein to template DNA, which can result in amplification of the template DNA of at 
least 10-fold and up to and greater than 1 million-fold amplification. Such an 
amplification system is useful for common molecular biology procedures, including but 
not limited to, the preparation of plasmid DNA from cells for DNA sequencing, cloning, 
providing substrates for transcription/translation reactions, and mutagenesis. In addition, 
this generalized system for DNA amplification can serve as a useful way to prepare large 
amounts of human genomic DNA from small samples for genotype analysis. This could 
be important both for preserving or "immortalizing" small amounts of rare DNA 
samples, and for more general use in clinical applications as a way to avoid removing 
large amounts of blood from patients to obtam DNA for genotyping. Finally, this 
amplification system provides a very sensitive detection system for DNA contamination 
in samples, for example in enzymes that have been purified under good manufacturing 
protocol (GMP) for use in human therapeutics. 

One way in which the present invention is distinguished from other exponential 
procedures for amplification is that it does not require the use of exogenous 
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oligonucleotide primers. Exogenous oligonucleotide primers refer to small nucleic acid 
molecules that are generally in the range of 6 to 50 bases in length, although longer 
primers can work. They are normally synthesized by chemical as opposed to enzymatic 
methods. The primers are complementary to certain regions of the template and are 
5 utilized by the polymerase to initiate DNA synthesis: The region of the template that is 
amplified is defined by the primers utilized. Primers are required in the present 
invention, however they are made by one of the components of the system, the primase, 
e g', gene 4 protein, rather than being synthesized in advance and then added to the 
system. The addition of primers to the system of the present invention has little effect on 

10 increasing the rate of DNA synthesis (generally less than two-fold), as the system works 
very well even without exogenous primers. Thus, even if the replication system of the 
present invention contains exogenous primers, there is still substantial DNA synthesis 
that is exponential in nature that is independent of the primers. One of ordinary skill in 
the art would readily be able to distinguish DNA synthesis carried out by the claimed 

15 method fi'om exogenously added primer dependent synthesis. In the situation where 
DNA synthesis is dependent on specific exogenously added primers, a new set of 
primers will be required for each new DNA template amplified. In the situation where 
DNA synthesis is dependent on the exogenous addition of random primers (e. g. random 
hexamers) the DNA synthesis will not be exponential. This invention describes a system 

20 whereby DNA synthesis is exponential even in the absence of any exogenously added 
primers. 

The template DNA used in this invention also does not need have a terminal 
protein bound to either 5' end. A terminal protein is a protein like the type used in the 
bacteriophage 0 29 replication system, eg., phage 029 the gene 3 protein (Salas, Ann. 
25 Rev. Biochem. 60:39, 1991). Such a terminal protein may be present but is not 
fimctionally required for the amplification system to work. 

The term "constant temperature" refers to an amplification reaction that is carried 
out under isothermal conditions as opposed to thermocycling conditions. Thermocycling 
conditions are used by polymerase chain reaction methods in order to denature the DNA 
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and anneal new primers after each cycle. Constant temperature procedures rely on other 
methods to denature the DNA, such as the strand displacement ability of some 
polymerases or of DNA helicases that act as accessory proteins for some DNA 
polymerases. Thus, the term "constant temperature" does not mean that no temperature 
fluctuation occurs, but rather indicates that the temperature variation during the 
amplification process is not sufficiently great to provide the predominant mechanism to 
denature product/template hybrids. Preferably the constant temperature is less than 
60»C, more preferably less than 50«. still more preferably less than 45«, and can even be 
less than 40*'C. 

The constant temperature amplification is carried out "/« vitro", meaning that the 
reaction is not carried out in cells. Generally the amplification is performed using 
purified or at least substantially purified proteins. 

In a preferred embodiment the DNA polymerase is from a bacteriophage. Further 
preferred is that the DNA polymerase is from bacteriophage T7. An even further 
preferred embodiment is that the DNA polymerase is a mixture of enzymes including 
both the wild-type DNA polymerase from bacteriophage T7 as well as a mutant T7 DNA 
polymerase that lacks its 3' to 5' exonuclease activity. It is also preferred that the 
reaction mixture contains a helicase or a primase. The helicase and primase are 
preferably the gene 4 protein of bacteriophage T7, most preferably the 63-kDa T7 gene 4 
protein that encodes the helicase/primase complex. Also preferred in the reaction 
mixture is a single-stranded DNA binding protein, preferably a single-stranded DNA 
binding protein fropi Escherichia coli. 

Applicant has surprisingly found that such a combination of proteins results in an 
unexpectedly large amount of DNA synthesis, and that the kinetics of this DNA 
synthesis is exponential rather than linear in nature. By exponential it is meant that at 
some period of time during the reaction the rate of DNA synthesis increases. In other 
words, the amount of DNA synthesized at a particular time will be greater than twice the 
amount of DNA synthesized at half the time. For example, if the amount of DNA 
synthesized after 20 minutes is ten times the amount of DNA synthesized after 1 0 
minutes, then the kinetics of DNA synthesis is exponential. On the other hand, if the 
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amount of DNA synthesized after 20 minutes is only twice the amount of DNA 
synthesized after 10 minutes, then the kinetics of DNA synthesis is linear. In order to 
test for whether DNA synthesis is linear or exponential, a small amount of radioactive 
deoxynucleoside triphosphate can be added to the mixture; e.g. example, one million 
cpm of [a-^^P]dATP in a 50 nl reaction, or a specific activity of 40 cpm per pmol of 
dATP if the dATP concentration is 500 ^M. Once the reaction is initiated by the 
addition of the enzyme mixture, aliquots are removed at varying times and the amount of 
DNA synthesized is detennined. as described in Example 1. For example. 5 m aliquots 
could be removed at 0. 1. 2. 4. 8. 16 and 32 min (although the.amount of each aliquot 
removed and the times can vary). The amount of DNA synthesized at the different times 
IS plotted as a function of the time of the reaction. If DNA synthesis is exponential, then 
the shape of the plot will be sigmoidal with the slope of the cui^e increasing over some 
portion of the time interval. A reaction will be exponential only for a portion of the time; 
the dNTPs will be rapidly used and the rate of new DNA synthesis will then rapidly 
decrease. Thus to determine if a reaction is exponential it is important to inspect the 
kinetics of DNA synthesis over a broad range of time (for example, from 0 to 30 min), 
and determine if the rate of DNA synthesis increases over any period of the time of the 
reaction. Also, the total amount of DNA synthesized is quite large, typically at least 10- 
fold more than the amount of template added to the reaction mixture, and preferably is at 
least 1 00-fold greater, or at least 1 000-fold greater, or at least 1 0,000-fold greater, or at 
least 100,000-fold greater, or at least 1.000.000-fold greater, or at least 10.000.000-foId 
greater or even more. Thus, one unique feature of the present invention is that synthesis 
is both exponential and results in the production of a large amount of DNA. e.g. 10-100 
times or more greater than the amount of template. Furthermore as the present invention 
does not require the use of exogenously added primers specific for each template being 
ampHfied, it provides a generalized amplification method not limited to the region of the 
DNA template delineated by primers. 

Another very effective method of monitoring the amount of DNA synthesized 
using the isothermal exponential amplification system is to use fluorescent probes or 
fluorescent dyes such as SYBR Green U (Molecular Probes, Eugene, OR) to 
continuously report the amount of DNA in the reaction in real time, The basic principle 
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of this method has been used successfully to detemiine the initial concentrations of 
specific RNAs and DNAs in a PCR reaction (Gibson et al, Genome Research 6:995, 
1996). Fluorescence can be monitored at intervals of 15 seconds using fluorescence- 
based real-time PCR instruments such as the PE Biosystems 5700 (PE Biosystems, 
5 Foster City, CA), the Roche LightCycler (Indianapolis, IN), or fluorescence microtiter 
plate readCTS that can maintain a constant temperature and can carry out kinetic 
measurements such as the SPECTRAmax (Molecular Devices, Sunnyvale, CA). 

A preferred embodiment for amplification of a plasmid template utilizes a 
polymerase in the reaction mixture such that the production of amplified product does 

10 not require exogenously-added oligonucleotide primers, the amplification is exponential, 
and the amoimt of amplified product is at least 10-fold greater than the amoimt of 
plasmid template DNA put into the reaction mixture. More preferably the amplified 
product is at least 100-fold greater or 1,000-fold greater, even more preferably at least 
10,000-fold greater or 100,000-fold greater, and most preferably at least 1,000,000-fold 

15 greateror 10,000,000-fold greater. 

The present invention also concerns the use of other reagents that further enhance 
amplification carried out by a core set of enzymes (the DNA polymerase, the helicase, 
the primase and the single-stranded binding protein). Thus, in further preferred 
embodiments the amplification reaction mixture also includes one or more of the 

20 following: a single-stranded binding protein of bacteriophage T7 (gene 2.5 protein), a 
nucleoside diphosphokinase such as thiat from E, coli, inorganic pyrophosphatase such as 
that from E xoli, an ATP regeneration system such as the combination of creatine 
kinase and phosphocreatine, a 5' to 3' exonuclease, preferably the 5* to 3' exonuclease is 
the bacteriophage T7 gene 6 exonuclease, and a ligase, preferably the ligase is the 

25 bacteriophage T7 DNA ligase. In another preferred embodiment the invention features 
the addition of chemical additives to the reaction mixture that increase the efficiency of 
the amplification reaction. Three examples of such additives are potassium glutamate, 
DMSO, and dextran polymer. 

In another preferred embodiment the invention features the treatment of the 

30 enzymes prior to inclusion in the reaction mixture with UV irradiation in order to reduce 
the amplification that arises from the contaminating DNA in the enzyme preparations. 
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Preferably the ultraviolet light is a dose from 10 to 1000 ^W/cm^ or preferably 100 to 
1000 ^lW/cm^ for from 15 sec to 15 min. 

In other preferred embodiments, the amplification of the DNA is exponential, the 
DNA molecule to be amplified can range in size from 1,000 bp, to 2,000 bp, 3,000 bp, 
5 5,000 bp, 10,000 bp, or even larger, up to a complete eukaryotic genome. Both single- 
stranded and double-stranded DNAs are amplified efficiently. When the DNA fragments 
being amplified are less than one thousand bases, they can be very poor substrates for 
amplification unless they are circular. Amplification of the DNA molecule is such that ' 
the amount of amplified product is preferably at least about 10-fold amplified, more 

10 preferably the amplified product is 100-fold greater, or at least 1000-fold greater, or at 
least 10,000-fold greater, or at least 100.000-fold greater, or at least 1,000,000-fold 
greater, or at least 10,000,000-fold greater or even more than the DNA put into the 
reaction mixture. The constant temperature at which the amplification reaction is carried 
out is preferably less than 60 °C. more preferably less than 45 «C. and more preferably at 

15 37 "C. 

In yet another preferred embodiment the reaction mixture contains ATP and CTP 
in addition to the 4 dNTPs. 

A preferred source of the DNA molecule to be amplified is lysed E. coli cells and 
the DNA is either a plasmid, a bacterial artificial chromosome (BAC) or phage DNA. 
20 Preferably, the added plasmid DNA template is less than 1 00 ng and the reaction mixture 
is 10 to 200^1. 

The present method of amplification is especially usefiil to prepare a DNA 
molecule for use in a DNA sequencing reaction. Preferably, the reaction mixture 
containing the amplified DNA is treated with a phosphatase to remove dNTPs that may 

25 be present prior to use in a sequencing reaction. A preferred phosphatase is from arctic 
shrimp. The sequencing reaction preferably uses fluorescent primers or fluorescent 
dideoxynucleotides. A preferred embodiment for preparation of a DNA molecule for use 
in a sequencing reaction utilizes a polymerase in a reaction mixture such that the 
production of amplified product does not require exogenously-added oligonucleotide 

30 primers, the amplification is exponential, and the amount of amplified product is at least 
10-fold greater than the amount of template DNA put into the mixture. More preferably 
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the amplified product is 100-fold greater, or at least 1000-fold greater, or at least 10,000- 
fold greater, or at least 100,000-fold greater, or at least 1,000,000-fold greater, or at least 
1 0,000,000-fold greater or even more than the DNA put into the reaction mixture. 
In another embodiment, the invention features a method of isolating plasmid, 
5 phage, or bacterial artificial chromosomes (BAC) fi"om bacterial cells, such as fi-om E. 
coll cells and then amplifying that DNA in a reaction mixture that does not require 
exogenous oligonucleotide primers. The bacterial cells are lysed to form a lysate which 
is then added to the amplification reaction mixture. Lysis can be carried out by any 
method known to those who practice the art, such as the use of lysozyme or detergents. 

10 A preferred embodiment for amplifying DNA directly fi-om a bacterial cell utilizes a 
polymerase in a reaction mixture such that the production of amplified product does not 
require exogenously-added oligonucleotide primers, the amplification is exponential, and 
the amount of amplified product is at least 10-fold greater than the amount of DNA in 
the lysate that was put into the reaction mixture. More preferably the amplified product 

15 is 100-fold greater, or at least 1000-fold greater, or at least 10,000-fold greater, or at least 
100,000-fold greater, or at least 1,000,000-fold greater, or at least 10,000,000-fold 
greater or even more. 

Other types of cells can also be used. In preferred embodiments, the cells are 
eukaryotic microbial cells, e.g., yeast cells, fungal cells or multi-nucleate structure, or 

20 mammalian cells, e.g., human cells. 

In another embodiment the invention features a method of amplifying genomic 
DNA in order to preserve small amounts of DNA for the purposes of long-term storage. 
A preferred embodiment is the amplification of human genomic DNA firom a small 
sample firom a patient (e.g. a blood sample or cheek swab) that can then be used for 

25 genotype analysis. 

In another embodiment the invention features a method of deteraiining the 
amount of contaminating DNA in a sample by incubating the sample potentially 
containing contaminating DNA with a reaction mixture comprising a DNA polymerase 
such that if the contaminating DNA is present the DNA molecule is amplified to produce 

30 amplified product. At least one pre-determined amount of control DNA is incubated 
with the same reaction mixture to produce amplified control product. The amount of 
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amplified product in the sample is compared with the amount of amplified control DNA 
as an indication of the amount of contaminating DNA in the sample. Alternatively, the 
rate of DNA synthesis is monitored in real-time using a probe that fluoresces only when 
bound to DNA; the greater the amount of input DNA, the shorter the lag period before 
5 the exponential phase of DNA synthesis can be observed (see Example 4). 

Contaminating DNA refers to any DNA that may be contaminating a protein 
preparation. In general it is most likely to arise fi-om the host organism in which the 
protein was obtained, and not purified adequately to remove it, or could be acquired 
during the purification of the protein fi-om columns or reagents. Thus, the most likely 
10 source of contaminating DNA is E. coli^ if the protein is a recombinant protein produced 
in E. coli, or it is human DNA if the protein is a recombinant protein produced in human 
tissue culture cells. 

By "predetermined amount of control DNA" is meant an amount of DNA such 
as a supercoiled plasmid (eg., pUC18 or pUC19) for example in the range of one 

1 5 femtogram to one nanogram in a 50 ul reaction. The synthesis obtained using this 

amount of DNA would be compared to an analogous reaction that was carried out in the 
absence of added DNA. The amount of synthesis at varying times specific for the added 
plasmid DNA would be determined by subtracting the amount of DNA synthesized in 
the absence of any DNA from that obtained in the presence of plasmid DNA. In fiirther 

20 preferred embodiments the DNA polymerase is fi-om bacteriophage T7, the reaction 

mixture fiirther comprises a helicase, a primase and a single stranded binding protein, the 
control DNA is amplified at least about 10-fold, the reaction does not require 
exogenously-added oligonucleotide primers, the reaction mixture is carried out at a 
constant temperature, the control DNA is amplified at least 10-fold, and the kinetics of 

25 amplification of the control DNA is exponential. 

In another aspect, the invention features a kit for amplification. The kit preferably 
contains a DNA polymerase, a primase, a helicase and a single-stranded binding protein. 
In fiirther preferred embodiments, the components of the kit comprise those 
independently selected fi"om the group consisting of a T7 DNA polymerase, the gene 4 

30 protein fi^om bacteriophage (the primase and the helicase), a DNA polymerase mixture of 
a wild-type T7 DNA polymerase and a T7 DNA polymerase modified to have reduced 3' 
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to 5' exonuclease activity, and the single-stranded binding protein from E. coli. More 
preferably, these components, a T7 DNA polymerase, the gene 4 protein from 
bacteriophage (the primase and the helicase), a DNA polymerase mixture of a wild-type 
T7 DNA polymerase and a T7 DNA polymerase modified to have reduced 3' to 5' 
5 exonuclease activity, and the single-stranded binding protein from £. coli, are selected as 
a group. Still more preferably, the gene 4 protein is the 63-kDa form of the protein. The 
kit can also contain one or more of the following components: a nucleoside 
diphosphokinase, an inorganic pyrophosphatase, an ATP regeneration system, preferably 
consisting of phosphocreatine and creatine kinase, a T7 gene 6 exonuclease, a T7 DNA 
10 ligase (gene 1.3 protein) and a single-stranded DNA binding protein of bacteriophage T7 
(gene 2.5 protein). 

In a further aspect the invention features a method of purifying the T7 
helicase/primase (63-kDa gene 4 protein) by overproducing the protein in an E. coli cell 
preferably along with a carboxy terminal fragment of the T7 gene 4 protein, preferably 
15 the peptide comprises the carboxy terminus of the protein after residue 221 (glutamine), 
more preferably the peptide comprises the carboxy terminus of the protein between 
residues 260 (tryptophan) and residue 280 (serine), even more preferably the peptide 
initiates at residue 271 (arginine). This gene 4 protein peptide reduces the toxicity of the 
full-length gene 4 protein to the cells and increases the expression level of the full-length 
20 geneby at least 10-fold. 

In another embodiment the amplification reaction is carried out in a solid matrix 
such as agarose. Under these conditions, in the presence of a fluorescent dye such as 
SYBR Green n (Molecular Probes. Inc.. Eugene, OR) the amplification of single DNA 
molecules can be observed in a fluorescent microscope as focal centers of fluorescence 
25 that develop with time and are dependent on added DNA. In preferred embodiment, this 
amplification in a solid support is used to generate a library of individual "subclones" of 
DNA molecules, without having to passage the DNA through E. coli cells. . More 
preferably these DNA clones are used as templates for DNA sequencing reactions, either 
by isolating individual clones and amplifying each in solution, or by carrying out the 
30 sequencing reaction using fluorescent dideoxy tenminators on all the clones together 
simultaneously on the agarose support. This latter approach would allow a tremendous 
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reduction in the amount (and thus the cost) of fluorescent sequencing reagents. Recently, 
Mitra and Church (Nucleic Acids Research 27:e34, 1999) have described a PCR-based 
system to amplify and detect individual DNA molecules embedded in a polyacrylamide 
support. 

5 The invention also provides a method for sequencing a DNA sequence by 

providing amplified DNA, that has been amplified using the constant temperature 
amplification method described above. The resulting products are sequenced by any of 
the conventional methods, such as dideoxy termination methods, preferably in an 
automated sequencer. Preferably the amplification is carried out in a solid matrix, 
10 preferably a gel such as an agarose or polyacrylamide gel. Preferably a circular nucleic 
acid construct with primer sites suitable for extension through the sequence of interest is 
utilized, e.g., as described in Example 10. Thus, the invention provides an efficient 
method of sequencing nucleic acids. 

As indicated above, the source of the template sequence to be amplified can be 
15 fi-om any of a variety of sources, including, for example, nucleic acid molecules &om 
blood fi-om human or non-human, cheek swabs, other tissue samples, microbial cells 
such as bacterial cells, viruses, plasmids or other cloning vector sequences, or other 
purified sequences from any source including synthetic sequences. 

Other features and advantages of the invention will be apparent from the 
20 following description of the preferred embodiments thereof and firom the claims. 

All articles, publications and patents cited in this application are hereby 
incorporated by reference, in their entirety. 

PRIEF DESCRIPTION QF THE DRAWINQ 

Fig. 1 represents the nucleotide sequence of pGP4A/E-l (SEQ. ID. NO. 1). 
25 Nucleotides 1-3 (ATG) correspond to the start codon for the 63-kDa gene 4 protein, 
while nucleotides 1699-1701 (TGA) correspond to its termination codon. Nucleotides 
2785-2787 correspond to the start codon for the truncated gene 4 protein beginning at 
residue 271 (encoded for by the codon at nucleotides 2785-2787), while nucleotides 
3670-3672 (TGA) correspond to the termination codon for this truncated gene. 
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DETAILED DRS CRJPTION OF THE PREFERRED EMBODTMF.NTS 

The following description and examples are provided for further illustrating 
various aspects and embodiments of the present invention and are in no way intended to 
be limiting in scope. 

DNA polvmerase 

In general a polymerase for use in this invention is one that cairies out extensive 
DNA synthesis on both strands of a DNA template, with the synthesized DNA in turn 
being capable of being used as a template for new DNA synthesis. "This results in an 
exponential increase in the amount of DNA synthesized with time. Many replicative 
DNA polymerases have accessory proteins such as processivity factors, helicases, 
primases and DNA binding proteins that are specific for each DNA polymerase (for 
review, see Romberg and Baker, DNA Replication, Freeman and Co., New York, 1992). 
Many such combinations will result in efficient DNA synthesis. The bacteriophage T7 
replication system provides one example of a polymerase and accessory proteins. Those 
of skill in the art will appreciate that combinations of polymerases and accessory proteins 
from other systems (e.g. bacteriophage T4 orE. colt) will also be useful in the present 
method of amplification. Such polymerases and accessory proteins can be used with all 
components being from one system, or can be used in other combinations with 
functionally compatible proteins from two or more different systems being utilized. 
Those of skill in the art would be able to determine such combinations of polymerases 
and accessory based on the Examples and descriptions provided. 

In the T7 DNA replication system, a combination of two forms of the T7 DNA 
polymerase result in the most- efficient DNA synthesis. One is the unmodified or native 
T7 DNA polymerase that has high 3' to 5' exonuclease activity (Tabor et ai, J. Biol. 
Chem, 262:16212, 1987). The other is a form of T7 DNA polymerase that is deficient in 
its 3' to 5' exonuclease activity. An example of such a DNA polymerase is the A28 T7 
DNA polymerase, missing residues from lysine 1 18 to arginine 145 (Tabor and 
Richardson, J, Biol- Chem. 264:6647, 1989; U.S. Patent Nos. 4,942,130 and 4,946,786). 
Both these forms of T7 DNA polymerase have the processivity factor thioredoxin bound 
to them in a one-to-one complex. DNA synthesis is most efficient when the 
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exonuclease-deficient fonn is in excess over native T7 DNA polymerase; the optimum 
molar ratio is about 20:1, Native and exonuclease deficient polymerases are known for 
other replication systems, such as bacteriophage T4, bacteriophage 029, and E. coli 
DNA polymerases I, n and HI. 

DNAhelicase 

Most replicative DNA polymerases require a DNA helicase for strand 
displacement leading strand DNA synthesis. In each replication system the helicase 
interacts specifically with the DNA polymerase from that system. The energy for 
helicase activity is obtained by the hydrolysis of nucleoside triphosphates. 

The helicase of bacteriophage T7 is the gene 4 protein. Its preferned substrate for 
hydrolysis is dTTP. The phage makes two forms of the gene 4 protein of molecular 
weight 56,000 and 63,000; the two forms arise from two in-frame start codons. Efficient 
amplification requires the 63-kDa form of the gene 4 protein, since, as discussed below, 
this form also provides primase activity (Bernstein and Richardson, J. Biol. Chem. 
264:13066, 1989). Altered forms, substitutions, insertions, deletions, of the 63-kDa 
protein are also suitable for the present invention. One example is the 63-kDa gene 4 
protein in which the methionine at residue 64 is changed to a glycine (G4Ao64): 
(Mendelman et al, Vmc. Natl. Acad. Sci. TT.<!A 89:10638, 1992; Mendehnan et al, 
J.Biol. Chem, 268:27208, 1993). This form of the 63-kDa gene 4 protein is useful 
because the mutation prevents the initiation of synthesis of the 56-kDa gene 4 protein. 
All enzymatic properties of the G4Ac64 form of the gene 4 protein that have been 
examined are comparable to those of the wild-type 63-kDa gene 4 protein, including its 
use as a primase and helicase for amplification as described in the current invention. 

DNA primase 

In most replication systems, synthesis of the lagging strand is initiated from short 
oligoribonucleotide primers that are synthesized at specific sites by primases. Specific 
interactions between a primase and the DNA polymerase allow the DNA polymerase to 
initiate DNA synthesis from the oUgoribonucleotide resulting in the synthesis of the 
lagging strand. 

A unique primase mechanism is found in tfie terminal protein exemplified by 
adenovirus and bacteriophage 029 (Salas, Ann. Rev. Biochem. 60:39, 1991). In these 
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organisms, duplex linear DNA is replicated by the covalent binding of the terminal 
protein to each 5* end, followed by initiation of DNA synthesis from a mononucleotide 
bound to the terminal protein by a DNA polymerase. While this system can provide a 
very efficient form of amplification of the phage genomes such as 029 DNA (Blanco et 
5 fl/., Proc. Natl Acad. Sci. USA 91 :12198, 1994), the nature of this priming mechanism 
makes it a difficult system to modify for general amplification of DNAs. 

In the T7 replication system, the gene 4 protein, in addition to being a helicase, is 
also the primase (Bernstein and Richardson, Proc. Natl Acad. USA 85:396,1988; 
Bernstein and Richardson, J. Biol. Chem. 264:13066, 1989). Only the 63-kDa form of 

10 the gene 4 protein has primase activity. At specific pentanucleotide recognition sites, the 
63-kDa gene 4 protein synthesizes tetraribonucleotides that are used as primers by T7 
DNA polymerase to initiate lagging strand DNA synthesis. The synthesis of RNA 
primers by 63-kDa gene 4 protein requires ATP and CTP. However, the addition of 
exogenous short oligonucleotides (7 bases or less) can be used by the gene 4 protein to 

1 5 initiate DNA synthesis primers by T7 DNA polymerase (Scherzinger et ai , Nucleic 

Acids Research 4:4151, 1977). The 63-kDa gene 4 protein is essential for amplification 
using the T7 replication system. While amplification is most efficient when ATP and 
CTP are present, it is possible that short oligonucleotides could be added to the reaction 
mixture to provide priming for amplification in the absence of ATP and CTP. While the 

20 native T7 DNA polymerase is capable of interacting with the 63-kDa gene 4 protein to 
initiate DNA synthesis from RNA primers, the A28 T7 DNA polymerase (missing 
residues 1 18 to 145) does so much less efficiently. This may be one explanation why a 
small amount of native T7 DNA polymerase is critical for efficient amplification. 

Single-stranded DNA binding protein 

25 Most replication systems require single-stranded DNA binding proteins. They 

serve a number of roles (for review, see Romberg and Baker, DNA Replication, Freeman 
and Co., New York, 1992). For example, they remove secondary structure from single- 
stranded DNA to allow efficient DNA synthesis. 

The single-stranded DNA binding protein (ssb) from E. coli has been extensively 

30 characterized. It has a dramatic effect on improving the processivity of T7 DNA 
polymerase, particularly at temperatures below 30"* C (Tabor et al, J. Biol. Chem, 
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262:16212, 1987). A low level of ssb is required for efficient amplification using the T7 
replication proteins described in this invention, Optimally, the amount of ssb for a 50 ^1 
reaction is from 0.01 to 1 fig. 

Bacteriophage T7 also encodes its own single-stranded DNA binding protein, the 
gene 2.5 protein (Kim et aL J. Biol. Chem. 267:15022, 1992) The presence of this 
protein is less important for amplification than is the E. coli single-stranded' DNA 
binding protein. However, it does stimulate the rate of DNA synthesis several-fold at a 
concentration of fi-om 0.05 to 5 |ig in a 50 ^il reaction containing the core amplification 
components. 

Nucleoside Diohosnhokinase 

Nucleoside diphosphokinase rapidly transfers the terminal phosphate fi"om a 
nucleoside triphosphate to a nucleoside diphosphate. It is relatively nonspecific for the 
nucleoside, recognizing all four ribo- and deoxyribonucleosides. Thus it efficiently 
equilibrates the ratio of nucleoside diphosphates and nucleoside triphosphates among all 
the nucleotides in the mixture. This enzyme can increase the amount of DNA synthesis 
if one of the required nucleoside triphosphates is preferentially hydrolyzed during the 
reaction. 

In the bacteriophage T7 replication system, the T7 helicase (gene 4 protein) 
hydrolyzes dTTP to dTDP to provide energy (Matson and Richardson, J. BioL Chem. 
258:14009, 1983). Thus the concentration of dTTP will decrease disproportionately 
during DNA synthesis. The addition of nucleoside diphosphokinase will re-equilibrate 
the level of triphosphates of all the nucleosides present. This results in an increase of 
several-fold in the amount of DNA synthesized. Nucleoside diphosphokinase fi-om 
Baker's Yeast can be purchased fi-om Sigma Chemical Co. (St. Louis, MO) and the 
overproduction and purification of nucleoside diphosphokinase from E, coli has been 
described (Abnaula et al IBact. 177:2524, 1995). Other nucleoside diphosphokinases 
are known to those who practice the art and are useful in the present invention. 

Inorganic Pyrophosphatase 

During a DNA synthesis reaction, inorganic pyrophosphate will accumulate as a 
product of the reaction. If the concentration becomes too high, it can reduce the amount 
of DNA synthesis due to product inhibition. This can be prevented by the addition of 
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inorganic pyrophosphatase. Yeast inorganic pyrophosphatase can be purchased from 
Sigma Chemical Co.( St. Louis, MO). Other inorganic pyrophosphatases are known to 
those who practice the art and are useful in the presait invention. 

- ATP-reeeneratinp system 

During DNA synthesis in many DNA replication systems, some of the 
deoxynucleoside triphosphates will be degraded to deoxynucleoside diphosphates due to 
hydrolysis by the helicase present. This can be minimized by the use of an ATP 
regeneration system which, in the presence of nucleoside diphosphokinase, will convert 
any nucleoside diphosphate in the reaction mixture to the triphosphate. In the T7 
replication system, the helicase very rapidly degrades dTTP to dTDP for energy. The 
presence of an ATP-regeneration system will increase the amount of nucleotides capable 
of serving as precursors for DNA synthesis. 

A number of ATP regeneration systems are available and have been widely used. 
For example, the combination of phosphocreatine and creatine kinase will push the 
equilibrium between ADP and ATP towards ATP, at the expense of the phosphocreatine. 
Both phosphocreatine and creatine kinase are available from Sigma Chemical Co. (St. 
Louis, MO). 

Joining lagging strand fragments 

In most DNA replication systems, the lagging strand is synthesized as a series of 
short, "Okazaki" fragments, that are initiated by a short RNA primer at each 5' end. In 
order to form a continuous strand, a 5' to 3' exonuclease is needed to remove the RNA 
primer, and then, after a DNA polymerase fills in the gap, a DNA ligase is needed to seal 
the nick. In the T7 DNA replication system, these fimctions are accomplished by the 
action of the gene 6 protein, a 5' to 3' exonuclease, and the gene 1.3 protein, a DNA 
ligase (Engler and Richardson, J. Biol. Chem. 258:1 1197, 1983). Purification of the 
gene 6 protein (Kerr and Sadowski, J. Biol. Chem. 247:305, 1 972; Engler and 
Richardson, J. Biol. Chem. 258: 11 197, 1983) and gene 1.3 protein (Doherty et al., L 
Biol. Chem. 271:1 1083, 1996) have been described. Exonucleases and ligases from 
other replication systems are known to those who practice the art and are useful in the 
present invention. 
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Removal of contaminating DNA from enzvmes 

Since the amplification system described in this invention is nonspecific, 
contaminating DNA will potentially be a serious source of background. This is 
particularly a problem in enzyme preparations in which the enzymes bind DNA, such as 
5 DNA polymerases, helicases and DNA binding proteins. Most DNA can be removed 
fi-om enzyme preparations during purification. For example, an anion exchange column 
such as DEAE will bind DNA much tighter than it binds proteins. 

The residual DNA present in enzyme preparations can be inactivated as a 
template for DNA synthesis by treatment of the enzymes with ultraviolet light. Effective 
10 doses of ultraviolet light are 10 to 1000 )iW/cml In the T7 replication system, treatment 
of the enzymes with an ultraviolet dose of 200 ^iW/cm^ for 2 min reduces the 
background synthesis due to contaminating DNA by 99%, while it reduces the activity of 
the enzymes by less than 20%. This dose of ultraviolet light is equivalent to the dose 
obtained using a standard, hand-held short-wave ultraviolet light source 10 cm fi'om the 
15 enzyme mixture. The enzyme mixture is kept on ice during the treatment with ultraviolet 
light to help maintain its activity. 

Removal o f unincorporated nucleoside triphosphates from reaction mixture 

In order to be used for DNA sequencing reactions, the template DNA added to 
the reactions can only have low levels of unincorporated dNTPs. An efficient method of 

20 removing dNTPs is to treat the amplified product with alkaline phosphatase such as that 
fi-om arctic shrimp (Amersham Pharmacia Biotech, Inc., Piscataway, NJ). This 
phosphatase is rapidly inactivated by treatment at elevated temperatures. Thus treatment 
of a 50 |al reaction mixture with 0.1 to 10 units of shrimp alkaline phosphatase at 20- 
ITC for 10 to 60 min, followed by treatment at 65-95''C for 10 to 60 min, results in 

25 template DNA free of dNTPs that can be added directly to a DNA sequencmg reaction 
such as the dye terminator reactions using AmpliTaq FS enzyme (Applied Biosystems, 
Foster City, CA) or ThermoSequenase enzyme (Amersham Pharmacia Biotech 
Inc.,Piscataway, N.J). 
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Reaction mixture for amplification using T7 replication proteins 

Reaction mixtures contain a buffer at a concentration of 5 mM to 50 mM at pH 
7.0 to 8.5, preferably pH 7.5; e.g., 20-40 mM Tris-glutamate, pH 7.5. 2 mM to 15 mM 
Mg^"*^ is required, preferably 5 to 10 mM; e.g., 9 mM MgC^. A reducing agent is 
5 present, such as 1 to 50 mM dithiothreitol or 2-mercaptoethanol; e.g. 5 mM 

dithiothreitol. The reaction mixture contains 50 to 200 mM potassium glutamate; e.g. 
100 mM potassium glutamate, which stimulates the reaction. In contrast chloride salts 
generally inhibit the reaction; if NaCl or KCl are present, preferably their concentration 
is below 50 mM. The 4 dNTPs are present at a concentration preferably between 20 and 

10 2,000 ^iM, most preferably at a concentration of 500-1000 fiM of each dNTP. rATP and 
rCTP are each present at a concentration between 20 and 2000 |im, most preferably at a 
concentration of 500 to 1000 \iM. Other chemicals may be added that enhance the 
amplification reaction. For example, DMSO between 0.5 and 8% gives an increased 
amount of DNA synthesis, most preferably a concentration of 4%. Similarly, a dextran 

15 such as Dextran T-10 or Dextran T-500 polysaccharide (Amersham Pharmacia Biotech. 
Inc., Piscataway, NJ) between 0.5 and 20% enhances the amount of DNA synthesis, 
most preferably a concentration of 5%. After the addition of the DNA to be amplified 
and the enzyme mixture containing at least some of the T7 replication enzymes described 
above, the reaction is allowed to proceed at a temperature between 10 C and 50** C, 

20 preferably 37^ C, for 10 to 60 min, preferably 20 min. The reaction can be stopped using 
standard procedures such as the addition of EDTA to a final concentration of 25 mM, or 
heating the mixture at 70° C for 20 minutes. 

Sequencing fi"om Single Molecule Amplification 
The amplification reaction can be used to provide DNA for sequencing. The 

25 amplification is carried out in a solid matrix such as agarose. Under these conditions, in 
the presence of a fluorescent dye such as SYBR Green II (Molecular Probes, Inc., 
Eugene, OR) the amplification of single DNA molecules can be observed in a 
fluorescent microscope as focal centers of fluorescence that develop with time and are 
dependent on added DNA. Low-melt agarose such as SeaPlaque Agarose (FMC 

30 Products, Rockland, ME) is preferred as the matrix, since the DNA can be easily 

removed fi-om the agarose when desired by heat. It may be helpfiil to use a combination 
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of agaroses or other matrices to optimize the stability of the individual foci in the solid 
support while allowing the DNA to be readily extracted when desired. 

If the single molecules being amplified are &om a large DNA fi-agment that has 
been digested with a restriction enzyme (e.g. EcoRI), and then ligated intramolecularly to 
form circular molecules, the collection of foci within the agarose support will correspond 
to a population of "subclones" of that original DNA molecule, comparable to subclones 
obtained more conventionally by transforming these molecules into E. coli and then 
plating the cells to obtain individual clones. The advantage of the approach described 
here is that the entire library of clones is generated without the need to transform the 
DNA into bacteria, and that each "clone" observed as a foci in agarose is pure DNA that 
can be used directly for sequencing or other applications. Recently, Mitra and Church 
(Nucleic Acids Research 27:e34, 1999) have described a PCR-based system to amplify 
and detect individual DNA molecules embedded in a polyacrylamide support. 

Several different approaches can be used to sequence the DNA amplified in the 
agarose support. One would be to pick the DNA fi-om each foci into a tube, heat at 70 "C 
to melt the agarose, and then add an aliquot to a new amplification reaction mixture to 
amplify the DNA in solution. This amplified DNA could then be used for DNA 
sequence analysis as described in Example 5. 

Alternatively, the DNA on the agarose support could be sequenced directly. The 
unincorporated nucleotides fi-om the amplification would first be removed by soaking the 
agarose in a suitable buffer, e.g., 10 mM Tris-HCl, pH 7.5, 1 mM EDTA. The DNA in 
the agarose would then be denatured by treatment with 10 mM NaOH for 5 min, 
followed by neutralization by several washes in 10 mM Tris-HCl, pH 7.5, 1 mM EDTA. 
The primer used for DNA sequencing would then be soaked into the agarose and allowed 
to annea:l to the denatured DNA. Excess primer would be removed by several washes in 
10 mM Tris-HCl, pH 7.5, 1 mM EDTA, and then the DNA sequencing reactions would 
be carried out by the addition of sequencing reaction buffer, fluorescent nucleotides (e.g. 
BigDye nucleotides fi-om PE Biosystems (Foster City, CA)), and a DNA polymerase 
such as AmpliTaq FS (PE Biosystems, Foster City, CA). The reaction would be initiated 
by heating the sample to 70 *C. After a 10 min reaction, the sample would be chilled to 
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4 °C, arid the product DNA would be denatured by treatment again with 10 mM NaOH 
for 5 min. Finally, the salt would be removed by extensive washing in water. 

The samples are now ready to be directly loaded onto a capillary DNA 
sequencing instrument. Injection of the capillaries would be a modification of that used 
in existing instruments such as the PEBiosystems 3700 Genetic Analyzer (Foster City, 
CA). One direct-load approach would be to insert a 100-1000 capillary array into the 
sample, heat to 70 ^^C to melt the agarose, and then apply a voltage to electrosmotically 
inject the DNA into each capillary. While only a subset of the capillaries would contain 
DNA from a single foci, and thus produce unique DNA sequence, the advantages of this 
procedure would be the ease of carrying out multiple sequencing reactions 
simultaneously and the low reagent cost of carrying out such parallel reactions. 



Example 1: Amplification of purified olasmid DNA using a minimal number of T7 
replication proteins 

The reaction mixture (45 ^1) contains 20 mM Tris-glutamate, pH 7.5, 9 mM 
MgCh, 6 mM dithiothreitol, 100 mM potassium glutamate, 3.5% dimethylsulfoxide 
(DMSO), 7% Dextran T-500 polysaccharide (Amersham Pharmacia Biotech, Inc., 
Piscataway, NJ), 550 jiM each dGTP, dATP, dTTP and dCTP, 330 |aM ATP, 440 ^M 
CTP, and DNA. The DNA samples range from 0.1 pg to 10 ng of supercoiled double- 
stranded plasmid DNA. 

Although circular DNA is a preferred substrate for amplification, linear DNA can 
also be used. The DNA can be either single- or double-stranded. In addition to plasmid 
DNA, other DNAs such as phage lambda DNA, phage Ml 3 DNA bacterial artificial 
chromosomal (BAC) DNA and genomic DNA (e.g., bacterial or human) are eiffective 
templates for amplification. Synthetic DNA such as oligonucleotides 100 nucleotides in 
length can be used if they are first ligated to form circular molecules. Data suggest that 
for cicular DNA molecules the initial stages of amplification involve a rolling circle 
mechanism. 

One method of initiating rolling circle DNA synthesis on a plasmid DNA from a 
specific site that has been described is to use the site-specific single-stranded 
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endonuclease gene n protein from filamentous phage such as M13 or fd. This protein 
recognizes a sequence of approximately 50 bp (the plus strand replication origin) and 
introduces a specific nick into one of the strands (Higashitani et al. Nucleic Acids 
Research 20:2685, 1992). While the wild-type gene H protein requires supercoiled DNA 
for activity, mutant proteins have been described {e.g. glycine 73 changed to alanine) that 
reduce this superhelicity requirement, so that the protein now recognizes relaxed duplex 
DNA (Higashitani et al. Nucleic Acids Resftarr.h 20:2685, 1992). The purification of 
the wild-type and mutant gene E proteins have been described (Greenstein and Horiuchi, 
I. Molec. Biol. 197:157, 1987; Higashitani et al. Nucleic Acids Resftarrh 20:2685, 
1992). It has been demonstrated that cleavage of supercoiled DNA with the gene 11 
protein results in a substrate that the T7 DNA polymerase and T7 gene 4 protein can use 
to initiate lagging strand DNA synthesis (Harth et al, Eur. J. Biochem. 1 19:663, 1981). 
It has also been demonstrated that these molecules promote rolling circle DNA synthesis 
using the replication proteins from bacteriophage T4 (Meyer et al , J. Biol. Chem. 
256:5810, 1981). This class of site-specific nicking proteins may be usefiil for 
stimulating initiation of the amplification process described in this invention. However, 
in the invention described here the inventors have shown that surprisingly nicking of the 
circular DNA is not necessary for efficient initiation of DNA synthesis; the combination 
of T7 gene 4 protein and T7 DNA polymerase are able to efficiently initiate and 
synthesize DNA on covalently supercoiled plasmid DNAs whether they are supercoiled 
or relaxed. 

For diagnostic characterization of the extent of DNA synthesis, one of the dNTPs 
can be radioactively labeled; for example, [^H] dTTP can be used at a specific activity of 
20 cpm/pmol. The synthesized DNA can also be detected by fluorescence or 
chemiluminescence by incorporating the appropriate modified nucleotides using standard 
techniques (Current Protocols in Molecular Biology, Ausubel et al, eds., John Wiley & 
Sons, Inc., 1996). ). In addition, the rate of DNA synthesis can be monitored in real time 
by the addition of a fluorescent probe to the reaction mixture (see Example 4). 

The enzyme mixture (5 nl per reaction) is prepared in a buffer containing 20 mM 
Tris-glutamate, pH 7.5, 10 mM 2-mercaptoethanol and 0.5 mg/'ml bovine serum albumin. 
5 \i\ of this mixture contains 1 ngof A28 T7 DNA polymerase, 0.03 ^g of native T7 
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DNA polymerase, 1 ng of T7 63-kDa gene 4 protein, and 0.3 ^g of ^. coli single- 
stranded DNA binding protein. A28 T7 DNA polymerase is a one-to-one mixture of 
E. coli thioredoxin and T7 gene 5 protein with a deletion of residues 1 18 to 145, as 
described in Tabor and Richardson, J. Biol. Chem. 264:6647, 1989, and U.S. Patents 
Nos. 4,942,130 and 4,946,786. It is the same as Sequenase Version 2.0 enzyme sold by 
Amersham Pharmacia Biotech, Inc. (Piscataway, NJ). Native T7 DNA polymerase is a 
one-to-one complex of thioredoxin and wild-type T7 gene 5 protein. It can be purchased 
from Amersham Pharmacia Biotech, Inc. (Piscataway, NJ) or New England Biolabs 
(Beverly, MA). T7 63-kDa gene 4 protein is the G4Ag64 protein described in 
Mendehnan et al., Proc. Natl. Ac ad. Sci. USA 89: 10638, 1992 and Mendeknan et al, L 
PiQl Cben?, 268:27208, 1993; it is the wild-type gene 4 63 kDa protein except that the 
methionine at residue 64 has been replaced with a glycine to prevent initiation of 
synthesis of the 56-kDa form of the gene 4 protein. E. coli single-stranded DNA binding 
protein (ssb) is overproduced and purified as described in Lohman et al., Biochemistrv 
25:21 (1986). It can be purchased from Amersham Pharmacia Biotech, Inc. ( 
Piscataway, NJ). 

In order to reduce the level of background DNA synthesis resulting from 
contaminating DNA, the en^e mixture is treated with ultraviolet light at a dose of 200 
HW/cm^ for 2 min on ice prior to adding to the reaction mixture. This dose corresponds 
to a distance of about 10 cm from a standard short-wave ultraviolet lamp. Times of 10 
sec to 15 min are also effective at reducing the amount of contaminating DNA that can 
be replicated without affecting significantly the activity of the enzymes. 

To initiate the amplification reaction, 5 jil of the enzyme mixture is added to the 
45 \il reaction mixture. The reaction is then allowed to proceed at 37* C for 20 min. The 
reaction can be carried out at a temperature of lO^C to 45''C, for 1 min to 2 hr. The 
reaction is stopped by the addition of 5 jxl of 200 mM EDTA. 

There are many published procedures to measure the amount of DNA synthesized 
during the reaction. For example, if [^HJdTTP is present, the amount of radioactivity 
incorporated into DNA can be measured by binding the reaction mixture to a DE81 filter 
disk, then washing the disk with 0.3 M ammonium formate to remove unincorporated 
nucleotides, and then measuring the amount of radioactivity retained on the disk using a 
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scintillation counter, as described in Kim et al, J. Biol. Chem. 267:15032 (1992). 
Alternatively, the DNA can be analyzed by agarose gel electrophoresis and staining with 
ethidium bromide, either with or without digestion with restriction enzymes, following 
standard procedures (Current Protocols in Molecular Biology. Ausubel et al., eds., John 
Wiley & Sons, Inc., 1996). 

This procedure should result in the incorporation of about 5% of the total dNTPs 
into DNA, or about 1.5 ^g of DNA synthesized in the 50 ^il reaction mixture. If the 
reaction mixture contains 10 pg of plasmid DNA. this corresponds to a 150,000-fold 
amplification. 



Example 2 : Amplification of purified pla.«; mid DNA usinp a T7 renlicatinn sy gteni 
optimized for maximum rate and amount of DNA svnthesi.s 

The rate and amount of DNA synthesis obtained using the conditions defined in 
Example 1 can be improved by using the modifications described below. All other 
1 5 components and conditions are identical to those outlined in Example 1 . 

Phosphocreatine (1 1 mM) is added to the 45 ^l reaction mixture. This in 
combination with creatine kinase provides an efficient ATP regeneration system. The 
phosphocreatine that is most effective is synthetic phosphocreatine, Sigma catalog 
number P6502 (Sigma Chemical Co., St. Louis, MO), dissolved in H2O to a 
20 concentration of 500 mM. 

The enzyme mixture contains at least some of the following six enzymes. These 
are in addition to the A28 T7 DNA polymerase, native T7 DNA polymerase, 63-kDa 
gene 4 protein and E. coli single-stranded DNA binding protein, which are present in the 
amounts defined in Example 1. 
25 (1) Creatine kinase, rabbit muscle (Boehringer catalog number 127566) 

(Boehringer Mannheim, Indianapolis, IN). A stock solution is prepared at 100 mg/ml in 
H2O and stored at -40' C. Each enzyme mixture (5 ^1) for a IX reaction (50 \i\ total 
volume) contains 2 \i% of creatine kinase. Creatine kinase, in conjunction with 
phosphocreatine and nucleoside diphosphokinase, provides an ATP regeneration system 
30 that converts nucleoside diphosphates that arise during the reaction to nucleoside 
triphosphates. 
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(2) Nucleoside diphosphokinase, Baker's yeast (Sigma catalog number N0379) 
(Sigma Chemical Co., St. Louis, MO). A, stock solution is prepared at 1 mg/ml in H2O 
and stored at -40° C. Each enzyme mixture (5 ^il) for a IX reaction (50 |il total volume) 
contains 50 ng of nucleoside diphosphokinase. Alternatively, nucleoside 
diphosphokinase can be overproduced and purified from E. coli (Almaula et al., J. Bact . 
177:2524, 1995); 50 ng of the E. coli enzyme is used for a IX reaction. Nucleoside 
diphosphokinase is added to maintain an equal ratio of all four dNTPs. 

(3) Inorganic pyrophosphatase, Baker's yeast (Sigma catalog number 11891) 
(Sigma Chemical Co., St. Louis, MO). A stock solution is prepared at 1 mg/ml in H2O 
and stored at -40° C. Each enzyme mixture (5 for a IX reaction (50 nl total volume) 
contains 20 ng of inorganic pyrophosphatase. Inorganic pyrophosphatase is added to 
degrade the pyrophosphate that will build up during polymerization and could inhibit 
DNA synthesis by product inhibition. 

(4) T7 single-stranded DNA binding protein (T7 gene 2.5 protein). T7 single- 
stranded DNA binding protein is overproduced and purified as described by Kim et al. 
fj. Biol. Chem. . 267:15022, 1992). Each enzyme mixture (5 pi) for a IX reaction (50 \il 
total volume) contains 1 \ig of T7 single-stranded DNA binding protein. The T7 single- 
stranded DNA binding protein will increase the rate of DNA synthesis by several-fold. 
One possible explanation for this is that the T7 single-stranded DNA binding protein is 
necessary to have efficient coupling of leading and lagging strand DNA synthesis by the 
11 DNA polymerase and helicase/primase complex (Park et al, J. Biol. Chem. 
273:5260, 1998: Lee et al. Mol. Cell. 1:1001. 1998). 

(5) 11 gene 6 exonuclease. The combination of T7 gene 6 exonuclease and T7 
DNA ligase (T7 gene 1 .3 protein) are used to remove the RNA primers from the 5' ends 
of the lagging strand fragments and then, after the gaps are filled in by the wild-type T7 
DNA polymerase, seal the resulting nicks. The T7 gene 6 exonuclease is purified as 
described in Kerr and Sadowski, J. Biol. Chem. 247:305, 1972 and Engler and 
Richardson, J. Biol. Chem. 258:1 1 197, 1983. Each enzyme mixture (5 for a IX 
reaction (50 \i\ total volume) contains 50 ng of T7 gene 6 exonuclease. 

(6) T7 DNA ligase (gene 1.3 protein). The T7 ligase is added to seal any nicks 
that are present in the synthesized DNA, in particular those that occur on the lagging 
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strand. T7 DNA ligase is overproduced and purified as described in Doherty et aL, L 
BioL Chem. 271:11083. 1996. Each enzyme mixture (5 |il) for a IX reaction (50 ^il total 
volume) contains 0.1 |ig of T7 DNA ligase. 

As in Example 1, the enzyme mixture is treated with an ultraviolet dose of 200 
5 |iW/cm^ for 2 min on ice prior to adding to the reaction mixture. The amplification 
reaction is then initiated by the addition of 5 ^il of the enzyme mixture to the 45 ^il 
reaction mixture. The reaction is allowed to proceed at 37° C for 20 min, and then 
stopped by the addition of 5 |il of 200 mM EDTA. The products are analyzed as 
described in Example 1 . 
10 This procedure should result in the incorporation of up to 50% of the total dNTPs 

into DNA, or about 15 ^g of DNA synthesized in the 50 ^1 reaction mixtxire. If the 
initial reaction mixture contains 10 pg of plasmid DNA, this corresponds to a 1,500,000- 
fold amplification. 

15 Example 3 : Assay to determine whether amplification is exponential 

Reactions as described in Examples 1 and 2 are carried out using varying 
amounts of plasmid DNA and for varying lengths of time. A radioactively labeled dNTP 
is used in order to determine the amount of DNA synthesis; for example, 50 cpm/pmole 
[^H]dTTP or [a-^^P]dATP. 100 ^il reactions are carried out in the absence of added 

20 DNA, and in the presence of 1, 10, 100 and 1,000 pg of supercoiled pUC18 plasmid 

DNA. Reactions are carried out at 37° C and 20 |il aliquots are removed at 5, 10, 15, 20 
and 30 min and stopped by the addition of 5 fil of 200 mM EDTA, pH 8.0. The amount 
of radioactivity incorporated into DNA is determined using standard methods. For 
example, the amount of radioactivity that can be precipitated by trichloracetic acid, 

25 which will precipitate DNA but not fi-ee nucleotide, can be determined as described in 
Tabor and Richardson, J. Biol. Chem. 264:6647, 1989. Alternatively, the amount of 
radioactivity that is retained by DE81 filter disks in the presence of 0.3 M ammonium 
fomiate, pH 8.0, can be determined (Kim et ai, J. Biol. Chem. 267:15032, 1992); DNA 
but not dNTPs is retained by the disks under these conditions. 

30 For each time point, the amount of DNA synthesized in the absence of added 

DNA is subtracted firom the amount synthesized in the presence of DNA. Up to 20 min, 
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the amount of DNA synthesized in the absence of added input DNA should be very low 
(for example, less than 1% the amount synthesized in the presence of 100 pg of added 
input DNA). 

For each amount of input DNA, the amount of DNA synthesized above the 
background level synthesized in the absence of input DNA is plotted as a function of the 
time of the reaction. If amplification is exponential, then the rate of synthesis of DNA 
will increase during some portion of the time course (i.e., the plot will be sigmoidal). 
For example, the amount of DNA synthesized after 15 min might be 50-fold higher than 
the amount synthesized after 1 0 min. 

Example 4: Real-time amplification assav using fluorescence 

A very effective method of monitoring the amount of DNA synthesized using the 
isothermal exponential amplification system is to use fluorescent probes to continuously 
report the amount of DNA in the reaction in real time. The basic principle of this 
method has been used successfully to determine the initial concentrations of specific 
RNAs and DNAs in a PGR reaction (Gibson et al. Genome Research 6:995, 1996). The 
higher the initial DNA concentration, the shorter the lag period before the exponential 
phase of DNA synthesis. To measure the amount of DNA synthesis in real time, 
reactions were carried out as described in Example 3 except that radioactivity was 
omitted and the fluorescent dye S YBR Green II (Molecular Probes, Eugene, OR) was 
added at a final concentration of 1/80,000 the stock solution. The dye was added after 
the reaction mixture was treated with UV light to destroy contaminating DNA. S YBR 
Green II has virtually no fluorescence on its own but fluoresces very strongly when 
bound to nucleic acid. At concentrations higher than 1/80,000 of the initial stock it is 
inhibitory to the amplification reaction described in Example 3. Other dyes that 
fluoresce specifically when bound to nucleic acid will also work in the present invention 
(e.g. SYBR Green I, Pico Green, Oligreen, SYTO 1 1, SYTO 12, SYTO 13, SYTO 14, 
SYTO 15, SYTO 16 (Molecular Probes, Eugene, OR), and SYBR Green (PE 
Biosystems, Foster City, CA)) or any other dye which shows significantly enhanced 
fluorescence when bound to DNA. Preferably, the significantly enhanced fluorescence 
refers to at least a 10-fold increase in fluorescence upon binding to DNA. More 
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preferably, DNA binding will result in at least a 30-fold, 100-fold, 300-fold or 1000-fold 
increase in fluorescence. 

In the example, reactions are carried out at 37 °C. Fluorescence is monitored 
once every 15 sec for 30 min. Exemplary instruments that can be used to measure 
5 • fluorescence include real-time PCR instruments such as the PE Biosystems 5700 (PE 
Biosystems, Foster City, CA) and the Roche LightCycler (Indianapolis, IN), and 
fluorescence microtiter plate readers that can maintain a constant temperature and can 
carry out kinetic measurements, such as the SPECTRAmax (Molecular Devices, 
Sunnyvale, CA). Ideally the SYBR Green 11 fluorescence is monitored using an 

10 excitation wavelength of 480 nM and an emission wavelength of 530 nM. 

In order to measure the DNA concentration in unknown samples, a standard 
curve is determined for each experiment by carrying out reactions either in the absence 
of DNA (blank) or in the presence of known concentrations of a standard DNA such as 
supercoiled pUC18 (e. g. ten-fold dilutions from 0.1 pg to 10 ng). A standard curve is 

15 determined by plotting the time required for fluorescence to reach half-maximum (or 
alternatively the time required for the relative fluorescence to reach some threshold 
value) versus the log of the initial concentration of DNA. This plot yields a straight line 
over at least three orders of magnitude of initial DNA concentration. A comparison of 
the values obtained using unknown samples to the standard curve allows one to 

20 extrapolate the amount of DNA present at the outset in each sample. 

pxample 5: Use of amplified DNA as a template for DNA sequencmg reactions 
The following example shows how the amplification reaction can be used to 
generate enough template from a small amount of DNA to be sequenced using 

25 conventional radioactive and nonradioactive dideoxy terminator sequencing strategies. 
While the example shown is starting from a small amount of plasmid DNA, this 
procedure will also work using other DNAs such as those from bacterial artificial 
chromosomes (BACs), phage lambda, phage M13, or genomic DNA (e,g., bacterial or 
human). Also, while the example shown uses purified DNA, much less pure samples, 

30 such as that from freshly lysed E. coli cells, can also be used. 
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Supercoiled pUC18 DNA (100 pg) is added to a 50 ^l reaction mixture as 
described in Example 2, and amplified at ST'C for 20 min. The reaction is stopped by 
heating the mixture at 70*C for 5 min. 0.5 units of arctic shrimp alkaline phosphatase 
(Amersham catalog number E70092) (Amersham Pharmacia Biotech, Inc., Piscataway, 
NJ) is added, and the mixture is incubated at 37°C for 30 min. The phosphatase is then 
inactivated by incubation at 80°C for 15 min. The amplified DNA can now be directly 
added to a DNA sequencing reaction. Other methods can also be used to purify the 
template DNA away fi-om unincorporated dNTPs. These include precipitating the DNA 
with ethanol or isopropanol, and purifying the DNA by binding it to a silica matrix, for 
example using a commercial kit such as the QIAquick PGR Purification Kit, product 
number 28104 (Qiagen, Inc., Valencia, CA). 

In order to sequence the amplified DNA using the PE Biosystems BigDye 
Terminators (catalog number 4303149 (PE Biosystems, Foster City, CA), 1 ^l of the 
amplified DNA (500 ng) that had been treated with arctic shrimp phosphatase was added 
to 8 ^l of the BigPyeReaction Mixture, 1.5 \il (3 pmoles) of the "universal" forward 
primer, and 9.5 \il of water. Cycle sequencing was carried out using 25 cycles consisting 
of 96 *C for 10 sec, 50 "C for 5 sec, and then 60 "C for 4 min. After the reaction was 
complete the unincorporated dye terminators were removed by gel filtration 
centrifiigation using Centri-Sep columns (Princeton Separations, Inc., Princeton, NJ) and 
the eluent was used directly for capillary sequence analysis on the PE Biosystems 310 
single-capillary Genetic Analyzer or the PE Biosystems 3700 96-capillary Genetic 
Analyzer. The accuracy of the sequence was analyzed by determining the number of 
bases that had a Phred score greater than 20 (Ewing et al., Genome Research 8:186, 
1998). In general, three times less amplified DNA was required compared with 
supercoiled plasmid DNA to produce DNA sequence of a given quality as analyzed by 
their relative Phred scores. This is probably due to the fact that primers will anneal more 
efficiently to the amplified DNA since it is linear compared with the supercoiled plasmid 
DNA. 

The amplified DNA can also be sequenced using other methods of labeling the 
Augments. For example, it can be used a template for reactions containing radioactively 
labeled dideoxyterminators; 2 ^1 of the DNA can be added directly to the 20 ^1 reaction 
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mixture from a ThemioSequenase radiolabeled terminator cycle sequencing kit, 
Amersham Pharmacia product number US 79750 (Amersham Phamiacia Biotech, Inc., 
Piscataway, NJ). The DNA can also be used for DNA sequencing techniques that use 
either radioactive or fluorescently labeled primers. 

Example 6: Amplificati on and sequencing of plasmid DNA directly from a bacterial 
colony 

The procedure below describes the amplification in vitro of DNA obtained from 
direct lysis of a bacterial colony, followed by DNA sequencing of the amplified DNA. 
While the example is using cells containing a high copy plasmid (pUC18), one skilled in 
the art will recognize that cells containing other types of DNAs, such as bacterial 
artificial chromosome (BAC) DNAs, could also be used in this procedure. 

Bacterial cells containing the plasmid to be amplified and sequenced, for example 
a pUC18 derivatiye, are plated on LB plates containing 50 ^igAnl ampicillin at 37*" C 
overnight. A single colony is placed in 500 |il of a solution containing 20 mM Tris-HCI, 
pH 7.5, 1 mM EDTA and 100 ^g lysozyme. After incubation at room temperature for 15 
min, the solution is centrifuged at 14,000 rpm for 10 min. 1 |al of the supernatant is then 
added to a 50 |il amplification reaction as described in Example 2. The resulting 
amplified DNA is then sequenced as described in Example 4. While the centrifiigation 
step removes genomic DNA and thus improves the purity of the sample, it is possible to 
avoid this step and use the lysed sample directly for amplification and sequencing. 

While the lysis method described here uses lysozyme, other methods could be 
used, such as the addition of nonionic detergents (e.g. Triton X-100 or Tween 20) or the 
expression of the T7 lysozyme gene (gene 3.5 protein) within the E. coli cell (pLysS and 
pLysE, Novagen, Inc., Madison, WI). It is possible that the method of lysis will have to 
be optimized for plasmids of varying size in order to minimize the amount of 
contaminating chromosomal DNA released. 

Example 7: BAC end seouencing by selective amplification of the ends of the insert 

In this example the ends of BAC clones are sequenced by first preferentially 
amplifying DNA fragments containing the ends of each insert. This is accomplished by 
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digesting the BAG DNA and then canying out a ligation in the presence of a splint that 
will result in circularization of only the two end fragments. Since the amplification 
system described in Example 2 preferentially amplifies circular DNA when the DNA 
fragments are less than several kbp. this circularization results in an enrichment of the 
end fragments in the final amplified mixture. (This is not necessarily the case for other 
amplification systems of the invention.) While the example below uses the vector 
pBELO-1 1 (Research Genetics, HuntsviUe, AL), which has a single Bae I site and no 
Mlu I sites, the strategy used can readily be applied to other BAG vectors by choosing 
the appropriate restriction enzymes and designing the appropriate splint oligonucleotides. 



pBELO-11 BAG DNA containing inserts is isolated from E. coli cells using 
standard procedures. 50 ng of the DNA is treated with the enzymes Bae I, Mlu I and T4 
DNA ligase (New England Biolabs, Beverly, MA) and a 50-fold molar excess of the 
following three oligonucleotides: BAC-1 (5'-CGCGGTAGAGGGAGGTGAA-3'), BAG- 
2 (5'-CGCGGTACACCGAGTTAAT-3') and BAG-3 (5'-GTGGGTGTAG-3'). BAG-1 
15 and BAG-3 will anneal to form a split that will result in circularization and ligation of 
one end of the pBELO-1 1 DNA, while BAG-2 and BAG-3 will anneal to form a split that 
will result in circularization and ligation of the other end of the pBELO-1 1 DNA. The 
reactions are carried out in 20 ^1 in 20 mM Tris-HCl, pH 7.5, 10 mM MgCh, 5 mM 
DTT, 20 nM S-adenosylmethionine and 500 ^M ATP at 37 °C for 60 min. 5 ^l of the 
20 ligated products are amplified as described in Example 2 in a 50 |il reaction. After 
removal of the unincoiporated nucleotides, the amplified DNA is sequenced using the 
universal T7 (TAATAGGAGTGAGTATAGGGGGA) or SP6 
(GATAGGATTTAGGTGAGAGTATAG) primers that anneal upstream of each of the 
two ends of the insert in pBELO-1 1 . 

25 

Example 8: Generic amplification of human genomic DNA 

A cheek swab is taken of a human subject and applied to FTA Gard (GIBGO 
BRL, Rockville, MD). The DNA bound to the paper is purified using FTA Purification 
Reagent as described by the distributor (GBBCO BRL, Rockville, MD). The DNA is 
30 eluted from the paper in 1 0 mM Tris-HGl, pH 7.5, 1 mM EDTA by heating at 80 °G for 



32 



249/107 
Patent 



5 min. The concentration of the eluted DNA is approximately 1 ng/^l. One ^1 of this 
DNA is amplified in a 50 ^il reaction as described in Example 2, incubating at 37 °C for 
20 min. The concentration of the DNA in the completed reaction is 0.5 fig/^il, a 25,000- 
fold amplification. Aliquots of this DNA can be used directly for genotyping usiiig PGR 
reactions; e.g. 1 ^1 in a 25 ^il PGR reaction. 

Example Detection o f contaminating DNA by amplification reaction 

In order to characterize the level of contamination of DNA in an unknown 
sample, reactions are carried out as described in Example 2 using varying amounts of the 
unknown sample. As a control, a standard series of reactions are also carried out both in 
the absence of DNA and in the presence of 1, 10, 100 and 1,000 pg of a standard 
supercoiled plasmid DNA such as pUC18. The reactions are carried out at 37°G for 15 
min. The reaction mixtures are carried out in the presence of 20 cpm/pmole [^H]TTP, 
and the amount of DNA synthesis in each sample is determined by measuring the 
amount of radioactivity taken up into DNA, as described in Example 1. Alternatively, 
DNA synthesis can be monitored in real time using a fluorescent probe (see Example 4). 
The amounts of DNA synthesized in the unknown samples are compared with the 
amounts of DNA synthesized in the standard reactions containing different initial 
concentrations of the known DNA in order to extrapolate the relative amount of 
contaminating DNA in the unknown sampled 

Example 10: Single molecule amplification in agarose 

In accordance with the description above, in this example, single DNA molecules 
are amplified ais individual foci embedded in a solid matrix, such as agarose. The 
amplified DNA is detected by fluorescence in the presence of the dye SYBR Green II 
(Molecular Probes, Eugene, OR). 

Reactions are carried out as described in Example 4. Two-fold concentrated 
reaction mixtures are prepared, and then diluted two-fold with a solution containing 
liquid 1% SeaPlaque agarose (FMG Products, Rockland, ME). The agarose is prepared 
in water and dissolved at 90 ^G, then cooled to 37 before mixing with the reaction 
mixtures. The reaction mixtures contain a final 1/80,000 X concentration of SYBR 
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Green II fluorescent dye, and varying amounts of supercoiled pUC18 DNA, ranging 
jfrom none up to 100,000 molecules per fil. After mixing the agarose and the reaction 
mixture, the mixtm-e is plated on a microscope slide, covered with a cover slip, and 
placed on ice to solidify the mixture. The slide is then heated at 37 ®C and the 
5 progression of foci is monitored using a fluorescence microscope, exciting at a 

wavelength of 480 nM and observing the emission at 530 nM. In the absence of added 
DNA, approximately 20 foci are observed on a one cm^ region, that correspond to the 
amplification of contaminating DNA. In the presence of increasing amounts of pUCl 8 
DNA, the number of foci increasing proportionally, and thus correspond to amplification 

10 of the added DNA. Each foci corresponds to the amplification of a single DNA 
molecule, and thus is "clonal". 

While the example below uses supercoiled pUC18 DNA as a template, one 
skilled in the art will recognize that other circular DNAs could be used. For example, if 
BAG DNA was digested with EcoRI, and then ligated, a population of circular 

15 fragments, or a "library", would be generated. If a splint oligonucleotide was ligated 
between the EcoRI sites, these could then serve as primer sites to sequence each insert in 
both directions. Using this strategy, all the amplified foci embedded on an agarose bed 
could be sequenced in a single dye-terminator sequencing reaction. Afler carrying out 
the sequencing reactions, the unincorporated dye terminators and the salts could be 

20 washed out of the agarose and each foci could be sequenced directly by injection into a 
capillary-based sequencing instrument such as the PE Biosystems 3700. This would 
allow one to generate a library of subclones of a large DNA fragment rapidly without the 
use of £. coli cells, and then to sequence all the fi:agments in a single sequencing 
reaction, that could be directly loaded onto a capillary sequencing instrument. This 

25 would represent a tremendous reduction in the scale of required reagents and human 
labor involved in cloning and sequencing fi-agments. 



Example 11 : Overproduction and purification of the 63-kDa T7 gene 4 protein 

A preferred T7 gene 4 protein used in this invention is the 63-kDa form refeired 
30 to as G4Ag64 in Mendelman et al, fProc. Natl. Acad. Sci. USA 89:10638, 1992) and 
Mendelman et al, (J. Biol. Chem. 268:27208, 1993). It is the wild-type 63-kDa gene 4 
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protein except that the methionine at residue 64 has been replaced with a glycine to 
prevent initiation of synthesis of the 56-kDa form of the gene 4 protein. Expression of 
the T7 gene 4 in E. coli is toxic to the cells and must be kept tightly repressed when 
uninduced. Furthermore, when induced, the toxicity prevents efficient overproduction of 
the protein. It is known that mutants of gene 4 protein such as those in the nucleotide 
binding site eliminate this toxicity and result in much greater overproduction of the 
protein (Notamicola and Richardson, J. Biol. Chem. 268:27198, 1993). This observation 
suggests that it is the potent TTPase activity of the wild-type gene 4 protein that is 
responsible for the toxicity. Thus one method to increase the amount of wild-type gene 4 
protein overproduced in cells is to coexpress another molecule that will bind to the wild- 
type gene 4 protein and inhibit its TTPase activity. Three possible classes of molecules 
that may accomplish this are protein, RNA and DNA aptamers (Cohen et al., Proc. Natl. 
Acad. Sci. UgA 95:14272, 1998; Famulok and Jenne, Curr. Ooin. Chem. Riol 2:320, 
1998), which can be selected for from libraries on the basis of either binding to wild-type 
gene 4 protein in vitro, or inhibit the toxicity of its production in vivo. 

An alternative approach is to coexpress a fragment of gene 4 that interferes with 
the activity of the wild-type gene 4 protein. The 295 residue carboxyl-terminal fragment 
of gene 4 protein from arginine 271 to the end of the gene has this property; in vitro it 
inhibits the wild-type gene 4 protein's TTPase activity, and in vivo it reduces the toxicity 
of the wild-type gene 4 and increases the overproduction of the wild-type gene 4 protein 
by at least lO-fold (Guo et al., J. Biol. Chem. 274:30303, 1999). It is likely that other 
carboxyl-terminal fragments of the gene 4 will have a similar effect. 

To overproduce the G4A64A form of the gene 4 protein, the plasmid pGP4A/4E-l 
whose sequence is shown in Figure 1 (SEQ. ID. NO. 1), was used. It expresses both the 
G4A64A gene 4 and the gene 4 fragment initiating at codon 271, both from T7 RNA 
polymerase promoters. The two genes are in tandem, with the G4A64A expressed 
upstream of the gene 4 fragment that starts at codon 271 (see Figure 1). pGP4A/4E-l 
was used to transform the^. coli cells BL21/DE3 (Novagen, Inc., Madison, WI) and 
selected for by plating on agar plates containing 50 ng/ml kanamycin. The cells were 
grown in LB media containing 50 jig/ml kanamycin at SO^C. When the cell density 
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reached A59o=l, the gene 4 proteins were induced by the addition of 0.5 mM PTG. 
After inducing for 3 hours at SO^^C, the cells were harvested and frozen at -80°C. 

The 63-kDa gene 4 protein was purified by the procedure described by 
Notamicola et al ( J, Biol. Chem. 270:20215, 1995). Briefly, the cells were lysed by the 
5 addition of lysozyme and by sonication. After the cell debris was removed by 
centrifiigation, the 63-kDa gene 4 protein was precipitated by the addition of 15% 
polyethylene glycol 4000. After resuspension of the precipitate, the gene 4 protein was 
purified by phosphocellulose chromatography. Finally, the pool of the fractions 
containing gene 4 protein from the phosphocellulose column was purified by ATP- 
10 agarose affinity chromatography. The gene 4 protein fragment present in the extract 
separates from the G4A64A gene 4 protein both in the PEG precipitation, the 
phosphocellulose chromatography, and the ATP affinity chromatography. From 10 liters 
of induced cell culture, 100 mg of pure G4A64A gene 4 protein were obtained. 
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Other embodiments within the scope of the following claims. 



